Estimating entanglement measures in experiments 
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We present a method to estimate entanglement measures in experiments. We show how a lower 
bound on a generic entanglement measure can be derived from the measured expectation values of 
any finite collection of entanglement witnesses. Hence witness measurements are given a quantitative 
meaning without the need of further experimental data. We apply our results to a recent multi- 
photon experiment [M. Bourennane et al, Phys. Rev. Lett. 92, 087902 (2004)], giving bounds on 
the entanglement of formation and the geometric measure of entanglement in this experiment. 
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Introduction — Deciding whether or not a given state 
is entangled is one of the basic tasks of quantum infor- 
mation theory. In principle, one can determine the full 
quantum state via state tomography, and apply some sep- 
arability criteria afterwards. However, the tomography 
requires an effort which is growing exponentially with the 
number of parties. For practical implementations, it is 
therefore highly desirable to verify entanglement on the 
basis of only a few, maybe only one measurement. En- 
tanglement witnesses [1| are just the observables for this 
purpose: by definition they have positive expectation on 
every separable state, so when a negative expectation is 
found in some state, it must be entangled. Consequently, 
entanglement witnesses have been used in many experi- 
ments [1, 0, H| , and their theory is far developed 0, @, 0] ■ 

Besides the mere detection, the quantification of entan- 
glement is an even more challenging problem in the field. 
Here one aims at characterizing the amount of entangle- 
ment by so-called entanglement measures. Many entan- 
glement measures have been introduced for this purpose 
[8(. But even if a quantum state is fully known, the 
computation of given entanglement measure is often not 
straightforward. Needless to say that the efficient deter- 
mination of an entanglement measure in experiments is 
even more complicated. 

In this paper we present a method to estimate en- 
tanglement measures in experiments. We show that en- 
tanglement witnesses cannot only be used for the detec- 
tion of entanglement, but also for its quantification: any 
measured negative expectation value of a witness can be 
turned into a nontrivial lower bound on a generic entan- 
glement measure. Hence, if witnesses are already used for 
entanglement detection, the estimation of an entangle- 
ment measure requires no extra experimental effort. We 
describe the procedures for computing such bounds in de- 
tail for entanglement of formation [9( and the geometric 
measure of entanglement (Io| . Our method can not only 
be applied to the measurement of a single witness, but 
extends to incomplete tomography in general: for any fi- 
nite set of measured expectation values we characterize 



the best possible lower bound on any convex entangle- 
ment measure (or, more generally, any convex figure of 
merit) consistent with these expectations. Finally, we ap- 
ply our results to a recent multi-photon experiment [3( . 

The theoretical context of our method is the theory 
of Legendre transforms (also called Fenchel transforms 
or conjugate functions) This method has already 

been used to characterize additivity properties of entan- 
glement measures (l2| . The question how to estimate the 
entanglement when only partial knowledge is given was, 
to our knowledge, first addressed in Ref. [13]. Bounds on 
some entanglement measures from special Bell inequal- 
ities or entanglement witnesses have been obtained in 
Refs. @, 03] , and methods to estimate measures in ex- 
periments by making measurements on several copies of 
a state have been discussed in Ref. [Hj]. While finishing 
this paper, we learned that similar ideas and conclusions, 
illustrated with a discussion of a complementary choice of 
entanglement measures, are reached in a paper by Eiscrt, 
Brandao and Audenaert flit ]. 

Main idea of the estimation — Let us consider n wit- 
ness operators (or indeed any hermitian operators [i"?l]). 
Wi, . . . , W„ on the same Hilbert space, and some entan- 
glement measure E, assigning to every density operator p 
a numerical value E(p) characterizing its entanglement. 
We assume for the moment only that p i— > E(p) is con- 
vex and continuous. Suppose now that, for some state 
p, we have measured the expectations of the Wk, be., 
that we are given the real numbers wu = tr(p~Wk) for 
k = l,...,n. On the basis of these numbers we would 
like to calculate a lower bound on E(p) or, more precisely, 
the best lower bound 



e(wi, 



M{E(p)\tr(pW k ) 

p 



w k }, (1) 



where the infimum is understood as the infimum over all 
states compatible with the data Wk = tr(pWfc). 

The idea of our estimate is to characterize a convex 
function such as e : K™ — ► K or the entanglement mea- 
sure E itself as the supremum of all affine (i.e., lin- 
car+constant) functions below it. So let r = . . . , r„) 
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and w = (wi, . . . , w n ) be vectors, which wc use to define 
the linear function w i— > r • = ^2 k r k vj k , and consider 
bounds of the type 

e(i^) > r ■ u; — c (2) 

for arbitrary r and c. Note that by definition of e this 
is the same as saying that E(p) > r ■ w — c for every p 
giving the expectation values w k as in (JT|) . The constant 
c, which we try to choose as small as possible, hence needs 
to satisfy, for any p, the inequality 

c>5> fc tr( P W fe ) -E(p), (3) 
k 

where we already inserted the condition Wk = tr(pWfc). 
Obviously, the best choice of c is the supremum of the 
right hand side, which only depends on the operator W = 
y\ rfeWfe. Hence we can write 

c = E(j2r k W k ^) with (4) 

fe 

E(W) = sup{tr{pW)-E(p)}. (5) 

p 

Here (O is just the definition of E as the Legendre trans- 
form of the entanglement measure E. We now use the 
formula of the optimal constant c in @ to compute 
e. As a convex function it is the supremum over all affinc 
functions below it, which arc now parameterized by the 
"slopes" r (sec also Fig.Q]). Hence we arrive at the main 
formula of this paper, characterizing the lower bound on 
E, obtainable from the measured expectations w k : 

e{w)=sup{r-w-E(J2 k r k W k )}- (6) 

Once again this is a Legendre transform formula, saying 
that e is the Legendre transform of e(r) = E(J2 k r k W k ) ■ 
Of course, we want to apply formula ([6]) mainly when 
n = 1, or at least, when n is very small compared to 
the dimension of the full space of hermitian operators. 
It does involve the computation of two Legendre trans- 
forms: on the one hand, we have to compute E from |5|). 
For any choice of coefficients (n, . . . , r n ) the computa- 
tion of c = E(y~] k rfcWfe) already gives a partial solution 
to our problem of giving a lower bound on E(p) in terms 
of the measured expectations, namely a best linear lower 
bound of the form ((2|). Optimizing over r then gives 
the best overall lower bound (J6j) for which the Legendre 
transform has to be taken over a low (i.e., n-) dimen- 
sional space only (see Fig [I]). In any case the success of 
the method depends on the possibility of efficiently com- 
puting E. Clearly this will depend on the entanglement 
measure E and the witness W chosen. We will demon- 
strate now for important examples how the computation 
can be done. 

Convex roof constructions — Many entanglement mea- 
sures are defined by a standard extension process, which 




FIG. 1: A schematic view of the estimation method. In order 
to estimate the convex function e(w) we consider linear affine 
functions below it. The dotted line corresponds to a general 
estimate as in Eq. ((2j), the dashed line to an estimate with the 
same slope r, but the smallest possible c. This gives already 
the bound e(wo) > B — r ■ wo — c. By varying the slope r 
one arrives at the dashed-dotted line, which gives the best 
possible bound e(u>a). 

extends a function i— > E(\ip}) defined only on pure 
entangled states to all mixed states, namely as 

E{p)= inf J>£(|^)), (7) 

where the Pi are convex weights, and X^PilV'iKV'il = P- 
The convex roof (or "convex hull") is just the largest 
convex function smaller than E on pure states, and can 
therefore be computed as the supremum of its affine lower 
bounds, i.e., once again as a Legendre transform. E 
can then be simplified to a variational problem over pure 
states only: 

E(W) = supltr(pW)- inf Vp,£(|^))) 

p Pi, I'M J 

= Bupsup{y>{(^|W|Vi) -EQM)}} 
= bu P {MW>-.E(M)}. (8) 

Here, at the second equality, we converted the "— inf" 
into a "sup" and substituted p from the constraint p = 
SjPilV'i)(V't|- The constraint then becomes redundant, 
because the sup is taken over all values p, too. The sup 
over the pt can furthermore be dropped, because convex 
combinations of expressions of the form (|8|) cannot be 
larger than the largest of these values. So in the end 
we can use the Legendre formula ([5]) for E, with the 
simplification that we need only vary over pure states. 

In many cases the variation can be simplified by vary- 
ing first over orbits of the local unitary group, i.e., to 
consider vectors \ip) = (Ui ® t/a ) | <^>) with XJ\,XJ-x unitary 
matrices and \cf>) fixed. Since, by definition, entanglement 
measures are invariant under such transformations, the 
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second term in ([8]) is independent of the Ui, so we can 
maximize the first term separately. 

Consider, for example witness operators of the form 
W = at — |x)(xli which is a typical form of witnesses. 
Then we have to maximize |(xl(t^i ® U 2 )<l>)\- It is easy 
to see that this maximum is attained, when \x) and 
(Ui (g) U2)\(j>) have the same Schmidt basis, and the 
Schmidt coefficients are ordered in the same way (for a 
detailed proof see the appendix of Ref. 0]). Hence for 
a system composed of two d-dimensional ones, we only 
need to vary over d positive numbers with one normal- 
ization constraint (rather than d 2 complex amplitudes 
in |V>})- In the examples shown below this reduces the 
computation to a simple one parameter optimization. 

Entanglement of formation — The entanglement of 
formation Ep is defined as the convex roof of the function 
Ef(\'4 > )) = S (tr2(\ip)(ip\)) , the von Neumann entropy of 
the reduced state [9(]. It is one of the natural figures 
of merit for experimental achievements in state prepara- 
tion, because it quantifies the entanglement (measured 
in singlet pairs) which must be invested per realization 
of the state. In contrast, measures like the distillable 
entanglement tell us about the potential further uses of 
the state, which may be quite low, even if the state is 
entanglement-expensive to make. 

For small dimensions the direct computation of Ep 
along the lines described above is feasible. For higher 
dimensions it is convenient to solve ([8]) by an uphill iter- 
ation, which will find a maximum efficiently. 

To this end we rewrite the entropy term by the Gibbs 
variational principle, i.e., as the Legendre transform of 
the free energy F from statistical mechanics: 

S(p) = M{tT P H - F(H)} = -trp In p, (9) 
F(H) = mi{tr pH - S{p)} = -lntr{e- H ). (10) 

Here the first infimum is over all hermitian operators H, 
and the second is over all density operators p. The first 
infimum is attained for H = — In p, and the second one 
for p = cxp(— if)/tr(cxp(— H)). We followed the conven- 
tions from statistical mechanics by using natural loga- 
rithms, but have set the inverse temperature (3=1 [la ]. 
Inserting |9]) into the entanglement term in ([8]) we get 

E F (W) = supsup{(V>|(W -H® + F(H)}, (11) 

|V>> H 

where the first supremum is over all unit vectors of the 
bipartite system, and the second over all hermitian H of 
the first system. The point of this way of writing Ep is 
that the suprema over these two variables obviously com- 
mute, and that when one of them is fixed, the supremum 
(in fact, the absolute maximum) over the other variable 
can be computed directly (without a search algorithm). 
Indeed, for fixed H (|lip requires to be an eigenvector 
for the largest eigenvalue of (W — H <E> 1). On the other 



hand, when \ip) is fixed, the variation is exactly (J9]) for 
the reduced density operator p\ of which we know to 
be attained at H = — In p\ . Hence by alternating these 
steps, we gain in every step, and get convergence to a 
local maximum. In the cases we have tried, the local 
maximum was always independent of the starting point, 
giving strong support to the claim of having found the 
global maximum. Therefore the algorithm is a useful tool 
for finding the maximum. However, a guarantee cannot 
be given in this algorithm, so in principle the resulting 
entanglement lower bound © could be too optimistic. 

Geometric measure of entanglement — This measure is 
an entanglement monotone for multipartite systems [Icl ] , 
defined via the convex roof construction and 

£ G (|V)) = 1- sup |(^)| 2 (12) 

|0)=|o)|6)|c>... 

as one minus the maximal squared overlap with the fully 
separable states. For pure states, the geometric measure 
is a lower bound on the relative entropy and one can 
derive from it an upper bound on the number of states 
which can be discriminated perfectly by local operations 
and classical communication [l9[. We have then 

£ G (W)=sup sup {(i>\(W + \0}(<f>\M - I}. 

W |tf>=|o)|6)|c>... 

(13) 

To show how this optimization can be performed, let us 
assume for simplicity that we have three parties, i. e., 
\cf)) = \abc). If | a), | b) and |c) are fixed, we can perform 
the optimization by taking as an eigenvector corre- 
sponding to the maximal eigenvalue. If we fix 1-0) and 
two of the other vectors, e.g., \b) and |c), we have to 
find a vector \a) such that sup| a ^ K^aftc)! 2 =: | (ip\dbc) \ 2 . 
If the Schmidt decomposition of \ip) with respect to the 
A\BC partition is given by \tp) = J2i s i\ r lt)\ r lF C ) ^ we 
have | (V> | abc) \ = | £V Si{rjf \ a) (r]f G \ bc}\. This scalar prod- 
uct is maximal if the vectors are parallel. So we set 

\d)=N-J2^(^ C \bc)\ V f), (14) 

3 

where Af denotes a normalization. So this optimization 
can be iterated, as in the case of the entanglement of 
formation. Note that a similar iteration also delivers a 
method to calculate the geometric measure Ec(\tl')) for 
arbitrary pure states 

For special cases of witnesses, the Legendre transform 
can even be calculated analytically. Let us assume that 
the witness is of the form rW = r(al — |x)(xl)- Here, 
we have already inserted the r as it is used in Eq. |6|. 
If r > 0, we choose in Eq. (fT3"|) \<f>) orthogonal to |x), 
resulting in E(rW) = ra. If r < 0, one can directly 
verify that we have to choose \cf>) as the state with the 
largest overlap with |x), which results in 

E G {rW) = ^ + '(1 - r) 2 + *rE G (\x)) + ra - 1. 

(15) 
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Hence Eq can be computed, provided Eq(\x)) is known. 

Application to the experiment — The experiment in 
Ref. [3| aimed at the production of the W-state 



\W) 
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(|001) + |010) + |100)). 



(16) 



For the entanglement verification, two witnesses have 
been used. The witnesses and their mean values were 
given by Q 



have explicitly demonstrated the calculations for the en- 
tanglement of formation and the geometric measure of 
entanglement. Finally, we applied our results to experi- 
mental data, gaining new insights into already performed 
experiments. Identifying witnesses, which are not only 
capable to detect entanglement in noisy situations but 
deliver at the same time good estimates of entanglement 
measures is an interesting task for further study. 



\W)(W\, (Wi) 



-0.197 ±0.018, 
-0.139 ±0.030, 



where \ip wz ) = (\y + y + y + ) - \y~y~y~))/V% = 
i(VS\W) - |Hl))/2 is a GHZ type state. 

For the entanglement of formation, we consider the 
A|BC-bipartition, because of the symmetry the other 
bipartitions are equivalent. If we apply our theory on 
witnesses VVi and W2 separately, we get the bounds 
E [ p\p) > 0.308 ± 0.051 from Wi and E { p\p) > 0.140 ± 
0.051 from V\?2- If we use both witnesses at the same time, 
we get the bound 



E { f' 2 \p) > 0.309 ±0.050. 



(17) 



For the geometric measure, using Eq. (|15|) and the fact 
that E G {\W)) = 5/9 and E G (\^ Gli7 ')) = 1/2 we 
get the bounds E^ } (p) > 0.199 ± 0.022 from Wi and 

E ( g\p) > 0.019 ± 0.010 from W 2 . Using both witnesses 
simultaneously, we obtain the bound 



E { g' 2) (p) > 0.209 ±0.023. 



(18) 



The fact that the bounds from Wi are better than the 
ones obtained from W2 stems from the fact that W\ is by 
construction sensitive for detecting the W-state. If the 
W-state were produced perfectly, then the bound from 
Wi would give the exact value, since only the W state 
is compatible with (Wi) = —1/3. Naturally, the bounds 
using both witnesses are always better than the bound of 
the single witnesses alone, since more information on the 
state is available. In principle, one may still improve the 
bound by including all the measured coincidence proba- 
bilities from Ref. [3j . 

Along the same lines one can also investigate other 
experiments, where witnesses have been used [111- In 
the exceptional cases where complete state tomography 
has been done [if, one may, of course, also try other 
estimation methods. Then it would be of great interest 
to compare these methods with our proposed one. 

Conclusion — We proposed a method to estimate 
entanglement measures in experiments. To do so, we 
showed how entanglement witnesses can be used to ob- 
tain lower bounds on generic entanglement measures. We 
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